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1 Introduction. 

In this paper we will use the following notation. Given a set of positive 
integers S, we let S{x) denote the number of elements in S that are < x, 
and we let 15*1 denote the total number of elements of S. Given two sets of 
positive integers A and B, we denote the sumset {a + b : a E A,b E B} 
hj A + B; and so, the number of elements in A + B that are < x will 
be {A + B){x). For a finite set of integers J, and integers q > 2 and r, 
we let J{r,q) denote the set of elements of J which are = r (mod q). We 
will also use Vinogradov's notation: The statements "/(x) <^ g{x)" and 
"(7(x) ^ f{x)" are both equivalent to "/(x) = 0{g{x))"; and, we will use 
"/(x) g{x)" to indicate that the implied constant in the big-oh depends 
on a parameter y. Finally, by the statement f{x) ~ g{x) we mean that 

ii,„ m = 1. 



g[x) 



An old conjecture of Ostmann, which is sometimes called the 'Inverse 
Goldbach Problem', asks whether there is an additive decomposition of the 



primes, with at most finitely many exceptions (see [13|, p. 13); that is, do 
there exist sets of positive integers A and B, each with at least two elements, 
such that 

for n > Xo (for some xo), n E A + B <^=^ n is prime. 
Even though this question has withstood attack by several mathemati- 



cians, there has been much recent progress. For instance, Wirsing in |T6| 



Pomerance, Sarkozy and Stewart in [14], Hofmann and Wo Ike in and 



1 



Bshouty and Bshouty in []T| have shown that if such a decomposition exists, 
then one has the following estimates on the counting functions 

^ < A{x)B{x) < X, 



logx 

Elsholtz has shown in ISll that 



log X 



■C mm{A{x) , B (x)) < max{A{x) , B {x)) <^ y^log^x, 



which improves on another result of Hofmann and Wolke from 0. Further- 
more, Elsholtz uses these results to show that B (or A) cannot be the sum of 
two other sets, each with at least two elements; that is, Elsholtz has solved 
a ternary analogue of the above conjecture of Ostmann. 

It seems conceivable that sieve methods alone will not solve the Ostmann 
problem, but that some additional insight into the structure of sumsets is 
needed. This paper is a step towards this. 

In his proof of the above mentioned result, Elsholtz makes strong use of 
the fact that 

iA + B + C)ix) > 

logx 

This leads one to wonder whether this constraint can be weakened somewhat. 
To be more specific: 

Does there exist k > 1 and sets of positive integers A,B,C, each 
with at least two elements, such that A + i? + C is a set of primes 
with 

{A + B + C)ix) » -4-? (1) 

log X 

And the answer to this question is: Yes. The Hardy-Littlewood conjecture 
can be used to give solutions for every k > 3. Before we show how, we give 
here the form of the conjecture we will need (see §): 

Hardy-Littlewood Conjecture: Suppose that ai < a2 < ■ ■ ■ < a/c is a 
sequence of integers such that the polynomial (x + ai) (x + 02) ■ ■ ■ (x + a^) has 
no fixed prime divisors. Then, 

X 

^{n < X : n + ai, n + 02, + Ofc are all prime} ~ C{ai,...,ak 



log X 
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where C(ai, a^) is some constant which depends only on ai, a^. 



Now, suppose that A = B = {1,7}, and let C be the set of all positive 
integers n such that n + 2, n + 8, n + 14 are all prime. Then, A + B + C 
consists entirely of primes, and since (x + 2)(x + 8)(x + 14) has no fixed prime 
divisors, assuming the Hardy-Littlewood conjecture we get that 

Cix) ~ C(2,8,14)- ^ 



log X 
and so, 

{A + B + C){x) » 

log X 

which means that our question above has an affirmitive answer for all k, > 3. 

So, if we are to have any hope of extending Elsholtz's work to show that 
there are no triples A,B,C where A + B + C contains many primes (but not 
almost all primes), we have to account for the above "obstruction" arising 
from the Hardy-Littlewood conjecture. The following are all the cases where 
one can apply the Hardy-Littlewood conjecture to construct sets A, B, C such 
that (0) holds: 

Either \A + B\< k, or \A + C\ < k, or \B + C\ < k. (2) 
This now leads us to the following general conjecture: 

Conjecture 1. Suppose A, B, C are sets of positive integers with at least two 
elements each, such that A + B + C consists entirely of primes. If 

{A + B + C){x) 



log'^x 
then holds. 

In this paper we do not quite prove this conjecture, although we believe 
that it is true. One additional, technical assumption about the sets A, B, C 
is needed for our proof; basically, we need that there are not "too many" 
primes p which have "too many" solutions p = a + b + c, aEA,bEB,cEC. 
To state this technical assumption, we need the following definition: 

Definition. For a given collection of sets Ai, A2, ■■■,Ak let r{n] Ai, Ak) 
denote the number of solutions to 

n = ai + ■ ■ ■ + Ok, Oi & Ai for all i = 1,2, 
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We say that the collection of sets Ai, ...,Ak is regular if and only if for every 
e > 0, there exists D > such that for x sufficiently large, 

Y,r{n;Ai,...,Ak) < eAi{x) ■ ■ ■ Ak{x) , (3) 

n£S 

where 

S = {neAi + --- + Ak : n <x, r{n]Ai,...,Ak) >\og^ x}. (4) 



One easy consequence of the fact that Ai, Ak is regular is the following: 

Lemma 1. Suppose Ai, ...,Ak is regular. Then, there exists a constant E > 
such that for x sufficiently large, 

Ai{x)A2{x)---Ak{x) < (log^x) {Ai + --- + Ak){x). 

A proof of this lemma can be found in Section ^ 

Our Main Theorem is as follows: 

Theorem 1. (Main Theorem) Conjecture [7| holds if we assume A,B,C 
is a regular triple; that is, if A, B,C is a regular triple of sets of positive 
integers such that A + B + C is a set of primes, and 

\A\, \B\, \C\>2, and {A + B + C){x) > j^^, (5) 
then holds. 

Note: The conclusion of this theorem can possibly be proved under a weaker 
notion of regularity: One can maybe replace the "log^ x" in (|) with 
"exp(log^~°*^^'' x)", and still have the theorem hold. This would require sub- 
stantial modifications of many parts of the argument, including Propositions 
^ and ^, and Corollary |I[ 

Perhaps the most interesting feature of the theorem is the many different 
ingredients which are used to prove it (it looks like a problem tailor-made 
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for a single application of some sieve method), and the way they all fit to- 
gether. These include: the Large Sieve, Brun's Sive, Gallagher's Sieve, the 
"probabilistic method" and regularity principles (which are used to prove 
Proposition |^), translation invariant principles (which appear in Proposi- 
tion!^ and Lemma |lOD, and certain "maximality" or "local-global" principles 
(which appear in the proof of Lemma ^). 

The basic idea of the proof (of Theorem |l|) is as follows: We will prove 
the contrapositive of the Theorem by first assuming that (Q) fails to hold. 
Through a combination of Propositions || and |^ (which appear in the next 
section of the paper) and some basic combinatorial arguments, we will find 
subsets of A n [1, x], i? n [1, x] and C fl [1, x], call these subsets A, 13 and C, 
which will have certain usable properties. At this point, the proof will break 
down into two cases, with case 1 being where min(|y4|, \B\) > k and case 2 
where this min is < k. The most difficult and important case will be case 2; 
and for this case, we will construct subsets of A, B and C*, call these subsets 
S,L* and C*, such that the following inequalities hold: 



log X 



for some > 0, and 



log^+"x 



< |C*| < v^log^x. 



Then, we will show that most triples (a, 6, c) G S x L* x C* have the 
property that, for any integer r > 1 and some integer k (and x sufficiently 
large), the numbers 

a + b + c + k, a + b + c + 2k, a + b + c + rk 

have very few prime divisors in certain "long intervals". This result will 
follow by showing that the sets L* and C* are approximately "locally trans- 
lation invariant" , meaning that for 'many' primes p < a/x, the residue classes 
modulo p occupied by L* will be almost exactly the same as those occupied 
by L* + k, L* + 2k, and L* + rk; the same will hold for C*. The method 
used to prove this will involve combining very precise arithmetic information 
about the sets L* and C* together with a variant of Gallagher's Larger Sieve, 
and will be the subject of Lemma ^ within the proof of Proposition ^ 
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Next, using Brun's upper bound sieve, we will show that the number of 
integers n < x where n + k,n + k,...,n + rk all have such few prime divisors 
in these "long intervals" is <^ x log"'^''^ x; and so, from this and we will 
deduce 

{A + B + C){x) < l^ + L* + C*|log^x < xlog-'/^+^x = o(xlog-"x), 

for r > 2{k, + E). This will contradict the hypothesis of Theorem ^ and so 
the Theorem will follow. 

Although we indicated earlier how the Hardy-Littlewood conjecture can 
be used to produce sets A, B, C which satisfy the hypotheses and conclusion 
of this Theorem for k > 3, we can give a weaker, unconditional result. Be- 
sides the sharpness of the inequalities obtained through the Hardy-Littlewood 
conjecture, this result is also weaker in that it only holds for a fixed x. 

Theorem 2. Given integers 1 < ki < K2, for all sufficiently large x there 
exist sets of positive integers A,B,C C {1,2, ...,x}, with 

\A\ = Ki, \B\ = K2, \C\ > \B\ > \A\, 

such that A + B + C consists entirely of primes, and 

\A + B + C\>c, ^ 



where 0^1,^2 ^-5 some constant depending only on ni and K2. 



2 Proof of the Main Theorem (Theorem [J). 

We will prove the contrapositive of this theorem. So, let us suppose that 
fails to hold; that is, 

\A + B\, \A + C\, \B + C\ > K. 

In our proof, we will first require a result that is a slight generalization 
of a result of C. Elsholtz 0, as well as a result which allows us to extract 
subsets 

i C A n [1, x], 13 C Bn[l,x], and (7 C C n [1, x], 
such that A + B + C C (a/x, 2x). These first two Propositions are as follows: 
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Proposition 1. If F,G C {1,2, with I < 5 < \F\ < \G\, such that 

F + G consists entirely of primes in (y^, 2a;), andif\F\\G\ ^ x/\og^ x, then 

^ < < \G\ < v^log'x. 



Note: The constant 6 can certainly be improved liere, but sucli an improve- 
ment does not mucli affect tlie quality of the main result in this paper. 

Proposition 2. // A, B,C is a regular triple, then there exist subsets A C 
An[l,x], B <Z B n [l,x], C CCn [l,x], such that 

\A\ ~ A{x), \B\ ~ B{x), and l^l ~ G{x), (7) 

where 

\A + B + C\ ~ (v4 + 5 + C)(x) > -4— , 

log X 

and A + B + C C {^/x, oo) . 

For X sufficiently large, we may assume that 

\B\, \C\ > 2, and \A + B\, \A + B\, \B + C\ > k. (8) 

The first inequality holds since 

\A\ ~ A{x) > 2, \B\ ~ B{x) > 2, and l^l ~ G{x) > 2; 

and the second inequaltiy holds for similar reasons. 

For a given x, suppose that, without loss of generality, 

A{x), B{x) < G{x). 

Consider the two sets A + B and C, and let F be the set with the smaller 
number of elements, and G be the set with the larger number of elements. We 
will show that these two sets F and G satisfy the hypotheses of Proposition 
m, and we will use the conclusion of this proposition to show that |A + i?| is 
"large" , which will be important in later arguments. 

We first claim that \G\ > \F\ > k for x sufficiently large: To see this, we 
note that 

\Cf^C{xf > A{x)B{x)C{x) ^\A\\B\\C\ 

> li + E + CI > -4— , 
~ log^x 
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Thus, \C\ > K for X sufficiently large; and, |A + _B| > k, by (^. It follows 
then that \G\ > k for x sufficiently large. 

We also have that F + G C {^/x, oo), since A, 13, C satisfy the conclusion 
to Proposition ^. Thus, F and G satisfy the hypotheses, and therefore the 
conclusion, of Proposition |l] with 6 = n. Thus, 

< |i + 51,1^1 < log' X. (9) 



log'+" X 

Between the sets A and 5, let S* be the one with the smaller number of 
elements, and let L be the set with the larger number of elements. We now 
distinguish two cases: Case 1 is where l^l > k, and Case 2 is where \S\ < k. 

To prove (the contrapositive of) the Main Theorem in Case 1, we consider 
the two sets L + C and S, and let F be the set with the smaller number of 
elements, and G be the one with the larger number of elements. (Note: The 
sets F and G have now changed from how we defined them before.) These 
sets F and G satisfy the hypotheses of Proposition |^ with 6 = k, since 

1^1 > K and \L + C\> \L\ > \S\ > k =^ \G\ > \F\ > k, 

and since F and G satisfy the other hypotheses of the Proposition. As in 
(1^), we deduce from this that 

< IL + CU^I < v^log'x. 



log'+"x 
From this and (P) we deduce 



A{x),B{x),G{x) > 



log'+"x' 

Now, since A, B,C is a regular triple, this bound and Lemma |l| give 

{A + B + C){x) > 

which is absurd. 

We now consider Case 2, which is where 15*1 < k. For this case we will 
have from (|9|) that 

< 1^1,1(71 < log' x. (10) 



log^+^x 

We need the following Proposition to find subsets of L and C with usable 
properties. 
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Proposition 3. There exists a constant D > such that if x is sufficiently 
large, and if \S\ < k, (Case 2), then there exist subsets L* G L and C* G C 
with 

log x log X 

such that 

\L*\ \C*\ < 2\L* + C*\. (12) 
Let Si,S2 G S, with S2 > Si, be any two integers, set k = S2 — Si, and let 
L* = L* + Si = {i + si : ieL*}. 

Then, 

L* + C* = {i + c + si : (£,c)eL*xC*} 
consists entirely of primes, and so does 

L* + C* + k. 

We will need the following Proposition and its Corollary to unlock the struc- 
ture of the set L* + C*: 



Proposition 4. Let 



Q = max(|L#|,|C*|) > 



log'^+^+^x' 

by (|7i[) and ([7^/ Then, for any integer j > 1 we will have 



^(logp) #{(£,c) e L# X C* : i + c + jk = (modp)} 

P<Q 

= 0{j \L*\ \C*\ loglogx). 



Corollary 1. There exists a constant c > such that all but at most |L* + 
C*\/2 of the elements n e L* + C* have 

r 

logp < cr^ log log (13) 

j = l p<Q 

p\n-{-jk 
p prime 
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One more lemma will establish the Main Theorem: 



Lemma 2. For x sufficiently large, there are at most x\og ''^'^ x integers 
n < X which satisfy (OT 



We have from Proposition Corollary |T], Lemma and Proposition 
that for r = AD + 2k + 2 and x sufficiently large, 

{A + B + C){x) 

< 2\S + L + C\ [Prop. I 

< 2k \L\ \C\ 

< 2k\L*\\C*\ log^^x [Prop. I 

< 4/€ |L* + C*|log^^x 
= 4/? |L# + C*|log2^a; 

< 8k (log^^x) #{n eL* + C* : n satisfies ([l|)} [Cor. | 

< 8/t (log^^x) #{n <3x : n satisfies (|T|)} 

< 8k (log^^x^ 



log^^+«+^(3x) 



1 K+1/2 [Lemma |]. 
log ' X 

which contradicts (||), and so the theorem is proved. 



3 Proof of Theorem g. 



The proof is based on a twofold application of a counting argument due 
to Erdos, Stewart and Tijdeman compare also Lemma 6 in Pomerance, 



Sarkozy and Stewart 



Lemma 3. Let r be a positive integer. Let x > Xr be a sufficiently large 
positive integer and let T be a non-empty subset of {1, . . . ,x}. Then there 
exists a set S G T and a set of non-negative integers A such that A-\- S G T, 
and 

1^1 1^1=^- 
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Since we want to prescribe the number of elements of two sets A and B 
we apply this lemma once again to the set S. This gives 

Corollary 2. Let ki, k,2 denote positive integers. Let x > Xki,k2 be a suffi- 
ciently large positive integer and let T be a non-empty subset of {1, . . . ,x}. 
Let 

o 



R 



Then there exists a subset C G T and sets of non-negative integers A, B such 
that 

A + B + CcT, \C\> \A\ = \B\ = K2. 

It is obvious that one could iterate this argument. We resist doing this 
since we concentrate on ternary problems. 

Now let T denote the set of primes in [l,x]. Recall that by the prime 
number theorem with error term (see |]10[, §54) 

T = ; + - TTT + O ' 



logx (logx)^ \(logx) 

For large x we have that |T| — ki > ^^^^ Hence it follows (as in the proof of 
theorem 6 in [|l^) that 

^ ^ (toj^ + 2(1^) ^ X ^ X 

^^Tvvx'^i-i ~ fi;i(logx)''i 2(logx)''i+i ^ ' 

For the second application of the argument we observe that for large x we 
have R — K2> — ^ ■ The argument then gives: 



1^1 ~> '^2! y/tiCiogx)"! y _ X 
Our theorem now follows since \C\ < \A -\- B -\- C\. 
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4 Statements and Proofs of Some Technical 
Lemmas. 



We will need the following three sieve lemmas, and their various corollaries: 
the Large Sieve of Montgomery (see 0]), Brun's Upper Bound Sieve (see 
1^), and a variant of Gallagher's Sieve (see 0): 

Lemma 4. (Montgomery's Sieve) Given a set of integers J O {1,2, 

and for each prime p < x, let uj{p) be the number of progressions modulo p 

which J fails to occupy. Then, 



^q<Q f^'^il) Ylp\q p-J(p) 



One has the following corollary, which essentially appears in Vaughan's 
paper 

Corollary 3. For J and a;(p) as above, and T < y/x, we have 

2x 

— ~7 , , \ m ! 

ym l^V<T p J 

where m = [(log x) / (2 log T) J . 

(Note: In Vaughan's paper he proves this result with the factor 4 on the left 
hand side, instead of the factor 2. The reason is that he used an earlier, 
weaker form of the Large Sieve.) 

Lemma 5. (Brun's Sieve) Suppose that J C {1,2, ...,x} is the largest such 
set of integers which fails to occupy uj{p) < B progressions modulo p, for 
each prime p < z. Then, 

1,1 «,.n(i-^)- 

p<z ^ ^ 

Lemma 6. (Gallagher's Sieve) Suppose that J C {1,2, and \ J\ > 

U. Then, 

p-i 

|J|^(logx + 0(l)) > 5^ (logp)5^|J(c,j9)|l 

P<u c=0 

p prime 
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A corollary of this sieve which we will need is the following: 

Corollary 4. Suppose J is as in Lemma and let h{p) denote the number 
of residue classes modulo p occupied by J, for each p < U < \J\- Then, 



1 

logx + 0(l) > --^(logp) J] I J(c,p)p 

I I p<U c=0 



p<U 



logp 



We will also need the following inequality of Cauchy and Davenport (see 

my. 



Lemma 7. (Cauchy-Davenport Inequality) For sets G and H , let hi, /12 

and h^ denote the number of residue classes modulo p occupied by G, H and 
G + H , respectively. Then, 

h^ > mm{hi + h2 - l,p). 



Finally, we will also need the following simple consequence of the Cauchy- 
Schwarz inequality: 

Lemma 8. Suppose that J is a set of integers which occupies at most k 
progressions modulo m. Then, 

J2\Jia,m)\' > ^. 

a=0 

To prove this lemma, let 6 (a) be 1 if a is in one of the progressions occu- 
pied by J (there are at most k such progressions), and let it be otherwise. 
Then, the Lemma follows quickly from the Cauchy-Schwarz inequality: 

m— 1 /m— 1 \ /m— 1 

\2 



a=0 \a=0 / \a=0 

> 5^|J(a,m)|5(a) = \J\'. 



,a=0 
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We now prove those of the above lemmas which cannot be found in the 
literature, as well as Lemmas |l| and ^ 

Proof of Lemma |I]. To prove this lemma we let e = 1/2, and let -D > 
and S be as in the definition of regular sets. Further, let 

T = {ne Ai-\ hAk : n < x, r{n; Ai, Ak) < log^ x}. 

Then, 

Ai{x) ■ ■ ■ Akix) = ^r(n;Ai,...,Afc) 



ngT 



+ ^r(n;Ai,...,Afc) 



< {hg'' x)T{x) + -Ai{x) ■ ■ ■ Ak{x) 



< (log^x) + + 

+^Ai{x)---Ak{x). 

Rearranging terms gives 

Ai{x) ■ ■ ■ Ak{x) < 2{\og^ x) {Ai + ■ ■ ■ + Ak)ix); 
and so, the conclusion of the Lemma holds with E = D + 1. 

Proof of Lemma |2|. We note that if n satisfies (|13D then the largest prime 

2 

divisor of each of the numbers n + k,n + 2k, ...,n + rk is < log'^'^ x. Thus, 

2 

for each prime p G [log'^^ x,x), we must have that 

n ^ —k, —2k, —rk (modp). 

Thus, the number of progressions which n can lie in modulo p, for each such 
p, is h{p) < p — r. From Brun's Sieve, we get that the number of integers n 
satisfying (^) is 

/ \ 



•Cr X 



n 



1 ^ 

p 

log*^^ x<p<Q 
p prime 



< X exp 



E - 



\ 



log*^^ x<p<Q 
p prime 



X exp( — r(loglogx — O(logloglogx)) 

(16) 



X 



loff'-/^: 
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which proves the Lemma. 

Proof of Lemma ^. We have that for any pair of integers ji,j2 G J, 
\ji — j2\ < X, and so 

^ \ogp <log|ji-j2| < logX. 



p prime 



Summing over all pairs ji,j2 (of which there are at most | Jp), we get 



iJl^loga; > 5^ 5^ lo, 



il#j2 P prime 



p-1 

> Y^{'^Ogp)Y^i^{]i,]2eJ : ]IT^]2, 
p<U c=0 

ji=j2 = c (modp)} 

p<U \ \c=0 / / 

Using the fact that 

J]logp = 0{U) = 0(1 J|), 

p<U 

and rearranging terms in the above string of inequalities, we get 

p-i 

|J|2(logx + 0(l)) > J](logp)E I J(c,p)p, 

p<U c=0 

as claimed. 

Proof of Corollary ^. 

Since J occupies h{p) progressions modulo p, we have from Lemma || that 



,2 > i 
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Putting this into Lemma 0, we get 



I I p<u c=0 p<U 

p prime 



as claimed. 



5 Proof of Proposition J]. 

Let T=[S\ + 1. Then, we have |G| > |F| > r. 

The proof involves four iterations: In the first iteration we will show that 
|G| <^ a;(logx)~'^^°'^^-*, and thus |F| (loga;)'^"'^""*-"^''; in the second iteration, 
we will show that \F\ ^ exp((logx)'^~''); in the third iteration, we will show 
that \F\ > x^/^, for x sufficiently large; and, in the final iteration, we will 
show that 

-^<^\F\<\G\<^x'/Hog'x. 
log 

We note that our Proposition can be proved using three iterations (instead 
of four), as was done in 0; also, no attempt was made to optimize the powers 
of the logarithms appearing in the result. 

Throughout the proof we let hi{p) and h2{p) denote the number of residue 
classes occupied by F and G, respectively. Since no element of F + G can 
be divisible by a prime < y/x, we deduce that F + G occupies at most p — 1 
residue classes modulo p for each such prime. So, from Lemma 0, we deduce 

hi{p) + h2{p) < p. (17) 

We let u>{p) = p — h2{p) be the number of progressions which G fails to 
occupy; and so, ([TTI) implies that uj{p) > hi{p). 

For the first iteration, let /i,...,/^ be any r elements of F, and Z be 
the set of primes < y/x with the property that /i, all occupy different 
residue classes modulo p. Let P be the set of primes < ^/x, and set 

/(z) = E- = iogiog^+o(i)- E -• 

P&Z ^ P&P\Z ^ 
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To estimate this last sum, we first define 



P 

p|n 



Then, s{n) <C log log log n, and this upper bound is attained when n is the 
product of the primes < logn. Now, if p & P \ Z, then p\A, where 

i<«<i<T 



and so. 



- < X] - <r logloglogx. 



p ^ — ' p 

peP\z ^ p|A ^ 

Thus, 

f{Z) = log log a; — Or (log log log x). 

Letting n(x) be the product of the primes < ^/x, we deduce from Lemma 
that 

|G| < #{n<x : ((n + /i)(n + /2)---(n + /r),n(a;)) = l} 
•C X exp ( — r - 



X 

< 



log--°(^) 



X 



Thus, since x([ogx)~^ <^ l-^IIG*], we deduce |F| ^ (logx)'^~'^~°*^^\ as claimed. 

For the second iteration, let G F, where t = log^~^~°*-^^ x, and, 

as before, let Z' be the set of primes < ^/x where all the /j's fall into distinct 
residue classes modulo p. Then, as before, let 



x'^l\ 



Then, fi, ft are not distinct modulo p implies p\A'. As before, we have 



- ^ logloglog 



X. 



p ^ — ' p 

peP\Z' ^ p|A' ^ 
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Thus, if we let T = exp(log^~^/^+^/^ x), then 

E- > E - - E - 

p<T ^ p<T ^ peP\Z' 

p£Z' V prime 

= log log T — O (log log log a;). 
Now, applying Corollary |^ with 

m = [(log"/^-^/^ x)/2\ and u{p) = (logx)"-^-"^^) for all p e Z' n [2, T], 
we get 

|G| < #{n<x : ((n + /i)(n + /2)---(n + /i),n(x)) = l} 

X 

< 



f(logx)-/2-V2-o(l)^^^^ i 



m 



exp(2m) exp(log 



t/2-5/2 



X 



Thus, since x/ \og^ x < \F\\G\, we conclude that 



F\ > exp ( (1 -o(l))A/log""^x ) . 



For the third iteration, let T' = exp{^/\og^ ^3;/2) and m' = [^/log^ "^^"^J . 
Then, from Corollary H, we have 



logx + 0(1) > ^ 



logp 

hi(p) 

T'/2<p<T' ^-^^ 



(Note: we use the corollary with J = F, and we have from iteration two that 
\F\ > T' for X sufficiently large). So, for almost all primes p G [T'/2,T'] we 
have that uo{p) > hi{p) > pj log^ x\ and so, 

^ p log X 

T'/2<p<T ^ ^ 

Using Corollary |^ with T = T', we deduce 

\G\ « r < = a:^/2+o(i). 

m' 2^T'/2<p<T' p 
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Thus, since l-FUGI ^ xlog ^ x, we deduce |F| > x^^^ for x sufficiently large. 
For the last iteration, we have by Corollary H that 



logx + 0(l)> ^ 



logp 
hi(p) 



and it follows that almost all primes in [x^^^ /2, x^^^] have uj{p) > hi{p) > 
p/ log^ X. Thus, 



Euj(p) x^^^ 



By Corollary we have 



X X log^ X 



\G\ < < = Vxlog^x; 



\^Z^al/4/2<p<a'l/4 p J 

and so, since x/ \og^ x <^ |F| \G\, we deduce 



< |F| < IGI < Vxlog^x, 



log^+'^ X 
and the Proposition is proved. 



6 Proof of Proposition |2]. 

Since A, B,C is a regular triple of sets, we have from Lemma that for some 
E>0, 

MV^) Bjy^) C(v/i) ^ E ^A + B + C)iV^) 
A{x) B{x) C{x) - ^ ^ ^ {A + B + C){x) 

y/x\og^x log^+^a; 

71 K — ~ 7= — • 1-*-°^ 

X/ log X a/x 

Thus, for X sufficiently large, one of the following inequalities must hold: 



•4(v^) < 9if or Bi-Tx) < ^ or C(v/i) < (19) 
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Suppose that the inequahty holds for A{x) and A{y/x). Then, letting 
A = An {y/x, x], 13 = B n[l,x], and (7 = C n [1, x] 



gives 

\A\ ~ A{x), \B\ ~ B{x), \C\ ~ C{x), and \A + B + C\c (Vx, x]. 
Also, since A,B,C is a regular triple, we get 

< {A + B + C){x)-\A + B + C\ 

< i^{n = a + b + c : a e A,b e B,c e C, a < y/x} 

< A(v^) B{x) C{x) 

< X ~^/^A{x) B{x) C{x) 

< x-^/\log^ x) {A + B + C){x) 



Thus, 



iA + B + C)ix) ~ \A + B + C\ 



as claimed. We get the same conclusions for the remaining cases of (19). 



7 Proof of Proposition |3]. 

Since A, B,C is a. regular triple, one can easily deduce that for e = 1/12 and 
X sufficiently large, there exists E > such that if 15*1 < k, then 

J2 r{n;L,C) < e\L x C\. (20) 

nSLxC 
r{n;L,C)>logE x 

For the remainder of the proof of this Proposition, we will assume that E is 
such that this inequality is satisfied. 

The proof now proceeds using a probabilistic argument: Let L' and C 
be random subsets of L and C, respectively, where 

Prob(£ eL' \ieL) = Prob(c e C \ c e C) = — l^, 

log X 
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where all these probabilities are independent. Clearly, \L'\ and \C'\ each 
have a binomial distribution, which implies that the following occurs with 
probability 1 — o(l): 

^^"^^'^ ^^'^ < \L' X C'\ < 2E{\L' X C'l), (21) 
where E{\L' x C'\) is the usual expectation given by 

E(|L'xC'|)= Prob((£,c)GL'xC") = i^^. 

^ . log X 

In the course of our proof, we will show that the event 

(1 -6e)|L' X C'l < \L' + C'\ and (|21|) occurs (22) 



has positive probability, which will imply that there exists subsets L* G L 
and C* G C satisfying these same inequalities. If we can do this, then (|12D 
will hold (since e = 1/12), and ([Tl|) will hold for D = E + 1 and x sufficiently 
large. 

Thus, the Proposition will follow if we can show that (p^ has positive 
probability. We note that it suffices to prove that 

Prob ( \L' X C'\ - \L' + C'\ < 3eE{L' x C) ) > i, (23) 

since (pT]) holds with probability 1 — o(l). 

We now proceed to show that ( p3l) holds: Suppose that n G L + C has 
exactly k solutions to 

n = ii + ci, £k + Ck, each (£j, q) G L x C. 

Then, since the £j's are distinct, and the Cj's distinct, we have that all subsets 
of the following probabilities are independent 

Prob((£i, ci) e L' X C"), Prob((4, Cfc) e L' x C) ^ 



log X 



It follows then that if we let r'{n) be the random variable 
r'(n) = {{i,c)eL'xC' : n = i + c}, 



21 



then Prob(r'(?2) = d) has a binomial distribution, given by 



Prob(r'(n) = d)=\\\l- < 



^dj V log^^x; log^'^^x rfllog^'^^x' 
and, we have the following easily checked expectation formula 

^/ ,/ NN r(n: L, C) 
E{r'{n)) = ^-^4^, 
log X 

where r(n; L, C) is as defined in the Introduction. 
For bookkeeping purposes, define 

= {n E L + C : n has at most log"^ x solutions to 
n = i + c, {i,c) e L X C}; and 

N = (L + C) \ N. 



and define the random variable 

5{n) = 



0, if n ^ L' + C; 

1, iineL' + C. 



Then, from (^) and the above probability and expectation estimates, we 
have: 

E{\L' X C'\ - \L' + C'\) 

n&L+C 

= J2E{r'{n)-6{n)) + J2E{r'{n)-6{n)) 

neN n&N 

< ^^(rf- 1) Prob(r'(n) = d) + ^ E(r'(n)) 

TiGAf d>2 „g7V 
n^N d>2 \ ' '=> o n£N 



2\LxC\ e|LxC| 



log^-^a; log^^x V log^^x 



Ei\L'xC'\) (^e + 
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Markov's Inequality. If X is a non-negative random variable, then 

Prob(X>a) < 

a 

From this inequality with X = \L' x C'\ — \L' + C'\, together with our 
above expectation estimates, we deduce 

Prob(|L'xC"| - \L' + C'\ > 3eE{\L' X C'D) < ^ 

3 log X 

Therefore, (^) holds for x sufficiently large. 



8 Proof of Proposition ^, Corollary [1], and 
Lemma |9|. 

For a given set of integers J, let Jp denote the set of residue classes modulo 
p occupied by J, and let Jp denote those residue classes not occupied by J. 
Clearly, \Jp\ =p — \Jp\. 

For all integers j we have that (£, c) G L'^xC* is a solution to i+c+jk = 
(mod p) if and only if 

(£, c) = (r, — r — jk) (mod p), for some r G \ (L* — jk)p. 

From this and Cauchy's inequality we have 

Z ■= ^(logp)#{(£,c) G L# X C* : i + c + jk = {mod p)} 

P<Q 

= 5^(logp) \L*ir,p)\\C*i-r-jk,p)\ 

P<Q r(iL*\{L*-jk)p 

< zl"zl'\ 

where 

Z, = 5^(logp) Yl \L*ir,p)\'; and 

P<Q reL*\{L#-jk)p 

Z2 = 5^(logp) Y \C*{~r-jk,p)\\ 

P<Q reL*\{L#-jk)p 

To bound Zi and Z2 from above we will require the following three results: 
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Lemma 9. We have for Q = y^log ^ ' x that 

= log a; + O(loglogx) = and (24) 

p<Q I p<Q I^PI 

= log- + O(loglogx) = E^^- (25) 

Compare this with Elsholtz @]. 

Proposition 5. Suppose that J = L* or C* , and let K he the other set (if 
J = L*, then K = C* , and vice versa). Also, suppose that for each prime 
p < Q we have a set of residue classes Gp C Jp. Then, we have the following 
inequality 

J2i^ogp)J2\Ar,p)\' < l^pfE^^IT^ + O(loglogx) 

P<Q reGp \p<Q ^ ' ^''^ 

Lemma 10. Suppose J = or C* , and that K is the other of the two sets. 
Then, for any integer j > 0, 



_ 1^ 12 = '^(■^ loglogx). 

p<Q P I^pI 



The proofs of these last two results will make use of the following basic 
facts about the sets and C*: Since for every {i,c) G x C* we have 
i + c and £ + c + k are primes > ^/x, there can be no solutions to £ + c = 
(mod p)oTi + c + k = (mod p) for any prime p < Q < \fx. Thus, 

n(-C*)p = = (L# + A;)p n (-C*)p 



=^ L* and (L* + k)p are both subsets of (— C*)p. (26) 

Similarly, 

C* and (C* + k)p are both subsets of {-L*)p. (27) 

Resuming the proof of our Proposition ^ we have from Proposition |^ and 
Lemma ^ with J = and Gp = L* \ (L* — jk)p that 

Zi < l^*r(^E^q^ + 0(loglogx)j = 0{j\L*\' loglogx). 
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Applying these two results with J = C* and Gp = {—{L* \ {L*—ik)p) — jk)^ 
(note: \Gp\ = |L* \ (L* — jk)p\ ), we likewise get 



Thus, 



0{j \C*\' loglogx). 



Z < Z^'Z',/' = 0{j \L*\ \C*\ logloga;), 



which proves the Proposition. 

Proof of Corollary |1]. We have that 



i&L*+C* 



j = l p<Q 
p\n-\-jk 
p prime 



P 



< 



J2 J2 Yl 

(e,c)£L#xC* p'^Q 



j=l p<Q 
r 

= Y^OU \L*\ \C*\ log log x) 

= 0(r2 \L* + C*\ loglogx), 

where this last equality following from Proposition |^. It is now obvious that 
more than half the elements n G + C* satisfy (0), which proves the 
Corollary. 

Proof of Lemma ^. 

Since for p < Q we have L* + C* contains no numbers = (mod p), it 
follows from Lemma ^ that 



and so, 



\L*\ + \C;\ < p; 



111 1 

+ 7T^ > + 



\T*\ \C* 



'#1 



4 

> -. 

p 
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From this inequality we deduce 

p<Q \\-^p\ \^V\'> p<Q VI-^pI P \ 



- \T*\ 



- p 

= 21ogx + 0(loglogx). (28) 
Now, from Corollary ^ with J = and J = C* we deduce 

— ^ < logx + 0(logloga;) and < logs + 0(logloga;). 

p<Q l-^P I P<Q I pl 

Combining these two upper bounds with (pSD, we have that p^) is satisfied; 
and 

"^(^^SP) (—#- + —#-] = 21ogx + O(loglogx). 



This equation and ( p4D together imply that 

p<qP \^p\ p<q VI-^pI P I-^pI/ p<q I-^p I 

= logx + O (log log x), 

which gives that the first part of (|25|) is satisfied. The second part of (|25|) is 
satisfied by applying the same argument. 

9 Proof of Proposition |5]. 

Let 

Vpir) = I ^(r,p) 





J 




p- 




Kp\ 
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The sum we wish to bound from above is as follows: 
X := Y.^\ogp)Y,J{r,pf 



P<Q 

< 



p<Q reGp ^ f \ V\ \F \ p\J / 

p<Q reJp p<Q reGp ' ^''^ 



Now, we have that 



P<Q r£(-K)p 



where 



p<Q re{-K)p P<Q '^^Jp 

= \J\\\ogx + 0{l)) 

E^ = ^{\ogp) ^ = ^{\ogp)^——— 

P<Q rej^lq; ' ^' p<Q reJp ^ ' 

= l-^l'Er^ = l-^l'Oog^ + 0(logloga:)) 

P<Q 

S3 = ki^E^ = 

Note that the upper bound we derived for Ei comes from Lemma ^, together 
with the fact that logQ = (logx)/2 + 0(loglogx); the equation for E2 comes 
from Le mma and, the switching of some of the above sums from a sum 
over r G {—K)p to r E Jp is justified since Jp C [—K)p, by ( p6l) and (|27|) . 
It follows that 

F = ^1-^2 = 0(|Jplogloga;) . 
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Substituting this into (^) gives 
which proves the Proposition. 

10 Proof of Lemma [tQ| . 

For an integer let S{h) denote the symmetric difference between [J — hk)p 
and {J -{h- l)k)p. We note that \S{h)\ = |5(0)|. 
Now, since 

Jp \ {J-hk\ C S{h) U (Jp \ (J-(/i-l)A;)p), 
it follows that 

|jp \ {j-hk),,\ < \sm + |jp \ (j-(/^-i)fc)p|. 

For /i > 1 a simple induction argument then shows that 

\Jp \ {J-hk)p\ < h\S{0)\. (30) 

Now, from and (P^) we deduce that Jp, (J + A;)p C (—K)p, which gives: 

\Jp \ {J+k)p\ < irWp \ {J + k)p\ = - \{J + k)p\ 

= P - \Kp\ - \Jp\; 

and 

\iJ + k)p \ Jp\ < ITkYp \ Jp\ = p - \Kp\ - \Jp\. 

Thus, 

\Sm = \Jp \ iJ + k)p\ + \{J + k)p \ Jpl < 2{p - \Kp\ - \Jp\). (31) 
From this and the fact that 

p - \Kp\ = \fl<Yp\ < |Jp|, 
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we deduce 



p<Q 



(p - 






jp\) 


{p 







< 



(p 










Jp\) 




Up - 




Kp\) 



P<Q ^'^ 
0(log logx), 



P- \Kp 



by Lemma ^. From this, (0), and (|3T|), we deduce 



2^(logp) : < 



P<Q 



ip - \K,\y 



2j ^(logp) 
0{j log log x) 



(p - 


Kp 




Kp\) 


(p 







which proves the Lemma. 
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